Feature selection of unbalanced breast cancer data using particle swarm optimization

نویسندگان

چکیده

<p>Breast cancer is one of the significant deaths causing diseases women around globe. Therefore, high accuracy in prediction models vital to improving patients’ treatment quality and survivability rate. In this work, we presented a new method namely improved balancing particle swarm optimization (IBPSO) algorithm predict stage breast using unbalanced surveillance epidemiology end result (USEER) data. The work contributes two directions. First, design implement an (IPSO) avoid local minima while reducing USEER data’s dimensionality. improvement comes primarily through employing cross-over ability genetic as fitness function correlation-based guide selection task minimal feature subset sufficiently describe universe. Second, develop synthetic minority over-sampling technique (ISMOTE) that overfitting problem efficiently balance USEER. ISMOTE generates objects based on average with smallest largest distance from centroid object class. experiments analysis show proposed IBPSO feasible effective, outperforms other state-of-the-art methods; minimizing features 98.45%.</p>

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Novel Feature Selection Algorithm using Particle Swarm Optimization for Cancer Microarray Data

Microarray data are often extremely asymmetric in dimensionality, highly redundant and noisy. Most genes are believed to be uninformative with respect to studied classes. This paper proposed a novel feature selection approach for the classification of high dimensional cancer microarray data, which used filtering technique such as signal-tonoise ratio (SNR) score and optimization technique as Pa...

متن کامل

Cancer microarray data feature selection using multi-objective binary particle swarm optimization algorithm

Cancer investigations in microarray data play a major role in cancer analysis and the treatment. Cancer microarray data consists of complex gene expressed patterns of cancer. In this article, a Multi-Objective Binary Particle Swarm Optimization (MOBPSO) algorithm is proposed for analyzing cancer gene expression data. Due to its high dimensionality, a fast heuristic based pre-processing techniqu...

متن کامل

Particle Swarm Optimization based Feature Selection

Feature Selection is a pre-processing step in knowledge discovery from data (KDD) which aims at retrieving relevant data from the database beforehand. It imparts quality to the results of data mining tasks by selecting optimal feature set from larger set of features. Various feature selection techniques have been proposed in past which, unfortunately, suffer from unavoidable problems such as hi...

متن کامل

Feature Selection using Particle Swarm Optimization for Thermal Face Recognition

This paper presents an algorithm for feature selection based on particle swarm optimization (PSO) for thermal face recognition. The total algorithm goes through many steps. In the very first step, thermal human face image is preprocessed and cropping of the facial region from the entire image is done. In the next step, scale invariant feature transform (SIFT) is used to extract the features fro...

متن کامل

Adaptive feature selection using v-shaped binary particle swarm optimization

Feature selection is an important preprocessing method in machine learning and data mining. This process can be used not only to reduce the amount of data to be analyzed but also to build models with stronger interpretability based on fewer features. Traditional feature selection methods evaluate the dependency and redundancy of features separately, which leads to a lack of measurement of their...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Power Electronics and Drive Systems

سال: 2022

ISSN: ['2722-2578', '2722-256X']

DOI: https://doi.org/10.11591/ijece.v12i5.pp4951-4959